18 research outputs found

    Annotation of gene product function from high-throughput studies using the Gene Ontology.

    Get PDF
    High-throughput studies constitute an essential and valued source of information for researchers. However, high-throughput experimental workflows are often complex, with multiple data sets that may contain large numbers of false positives. The representation of high-throughput data in the Gene Ontology (GO) therefore presents a challenging annotation problem, when the overarching goal of GO curation is to provide the most precise view of a gene's role in biology. To address this, representatives from annotation teams within the GO Consortium reviewed high-throughput data annotation practices. We present an annotation framework for high-throughput studies that will facilitate good standards in GO curation and, through the use of new high-throughput evidence codes, increase the visibility of these annotations to the research community

    Annotation of gene product function from high-throughput studies using the Gene Ontology

    Get PDF
    High-throughput studies constitute an essential and valued source of information for researchers. However, high-throughput experimental workflows are often complex, with multiple data sets that may contain large numbers of false positives. The representation of high-throughput data in the Gene Ontology (GO) therefore presents a challenging annotation problem, when the overarching goal of GO curation is to provide the most precise view of a gene's role in biology. To address this, representatives from annotation teams within the GO Consortium reviewed high-throughput data annotation practices. We present an annotation framework for high-throughput studies that will facilitate good standards in GO curation and, through the use of new high-throughput evidence codes, increase the visibility of these annotations to the research community

    Evaluation of Intra-Host Variants of the Entire Hepatitis B Virus Genome

    Get PDF
    Genetic analysis of hepatitis B virus (HBV) frequently involves study of intra-host variants, identification of which is commonly achieved using short regions of the HBV genome. However, the use of short sequences significantly limits evaluation of genetic relatedness among HBV strains. Although analysis of HBV complete genomes using genetic cloning has been developed, its application is highly labor intensive and practiced only infrequently. We describe here a novel approach to whole genome (WG) HBV quasispecies analysis based on end-point, limiting-dilution real-time PCR (EPLD-PCR) for amplification of single HBV genome variants, and their subsequent sequencing. EPLD-PCR was used to analyze WG quasispecies from serum samples of patients (nβ€Š=β€Š38) infected with HBV genotypes A, B, C, D, E and G. Phylogenetic analysis of the EPLD-isolated HBV-WG quasispecies showed the presence of mixed genotypes, recombinant variants and sub-populations of the virus. A critical observation was that HBV-WG consensus sequences obtained by direct sequencing of PCR fragments without EPLD are genetically close, but not always identical to the major HBV variants in the intra-host population, thus indicating that consensus sequences should be judiciously used in genetic analysis. Sequence-based studies of HBV WG quasispecies should afford a more accurate assessment of HBV evolution in various clinical and epidemiological settings

    Annotation of gene product function from high-throughput studies using the Gene Ontology

    Get PDF
    High-throughput studies constitute an essential and valued source of information for researchers. However, high-throughput experimental workflows are often complex, with multiple data sets that may contain large numbers of false positives. The representation of high-throughput data in the Gene Ontology (GO) therefore presents a challenging annotation problem, when the overarching goal of GO curation is to provide the most precise view of a gene's role in biology. To address this, representatives from annotation teams within the GO Consortium reviewed high-throughput data annotation practices. We present an annotation framework for high-throughput studies that will facilitate good standards in GO curation and, through the use of new high-throughput evidence codes, increase the visibility of these annotations to the research community

    Spatial and Temporal Dynamics of Hepatitis B Virus D Genotype in Europe and the Mediterranean Basin

    Get PDF
    Hepatitis B virus genotype D can be found in many parts of the world and is the most prevalent strain in south-eastern Europe, the Mediterranean Basin, the Middle East, and the Indian sub-continent. The epidemiological history of the D genotype and its subgenotypes is still obscure because of the scarcity of appropriate studies. We retrieved from public databases a total of 312 gene P sequences of HBV genotype D isolated in various countries throughout the world, and reconstructed the spatio-temporal evolutionary dynamics of the HBV-D epidemic using a Bayesian framework

    Epidemic History and Evolutionary Dynamics of Hepatitis B Virus Infection in Two Remote Communities in Rural Nigeria

    Get PDF
    BACKGROUND: In Nigeria, hepatitis B virus (HBV) infection has reached hyperendemic levels and its nature and origin have been described as a puzzle. In this study, we investigated the molecular epidemiology and epidemic history of HBV infection in two semi-isolated rural communities in North/Central Nigeria. It was expected that only a few, if any, HBV strains could have been introduced and effectively transmitted among these residents, reflecting limited contacts of these communities with the general population in the country. METHODS AND FINDINGS: Despite remoteness and isolation, approximately 11% of the entire population in these communities was HBV-DNA seropositive. Analyses of the S-gene sequences obtained from 55 HBV-seropositive individuals showed the circulation of 37 distinct HBV variants. These HBV isolates belong predominantly to genotype E (HBV/E) (n=53, 96.4%), with only 2 classified as sub-genotype A3 (HBV/A3). Phylogenetic analysis showed extensive intermixing between HBV/E variants identified in these communities and different countries in Africa. Quasispecies analysis of 22 HBV/E strains using end-point limiting-dilution real-time PCR, sequencing and median joining networks showed extensive intra-host heterogeneity and inter-host variant sharing. To investigate events that resulted in such remarkable HBV/E diversity, HBV full-size genome sequences were obtained from 47 HBV/E infected persons and P gene was subjected to Bayesian coalescent analysis. The time to the most recent common ancestor (tMRCA) for these HBV/E variants was estimated to be year 1952 (95% highest posterior density (95% HPD): 1927-1970). Using additional HBV/E sequences from other African countries, the tMRCA was estimated to be year 1948 (95% HPD: 1924-1966), indicating that HBV/E in these remote communities has a similar time of origin with multiple HBV/E variants broadly circulating in West/Central Africa. Phylogenetic analysis and statistical neutrality tests suggested rapid HBV/E population expansion. Additionally, skyline plot analysis showed an increase in the size of the HBV/E-infected population over the last approximately 30-40 years. CONCLUSIONS: Our data suggest a massive introduction and relatively recent HBV/E expansion in the human population in Africa. Collectively, these data show a significant shift in the HBV/E epidemic dynamics in Africa over the last century

    The Gene Ontology resource: enriching a GOld mine

    Get PDF
    The Gene Ontology Consortium (GOC) provides the most comprehensive resource currently available for computable knowledge regarding the functions of genes and gene products. Here, we report the advances of the consortium over the past two years. The new GO-CAM annotation framework was notably improved, and we formalized the model with a computational schema to check and validate the rapidly increasing repository of 2838 GO-CAMs. In addition, we describe the impacts of several collaborations to refine GO and report a 10% increase in the number of GO annotations, a 25% increase in annotated gene products, and over 9,400 new scientific articles annotated. As the project matures, we continue our efforts to review older annotations in light of newer findings, and, to maintain consistency with other ontologies. As a result, 20 000 annotations derived from experimental data were reviewed, corresponding to 2.5% of experimental GO annotations. The website (http://geneontology.org) was redesigned for quick access to documentation, downloads and tools. To maintain an accurate resource and support traceability and reproducibility, we have made available a historical archive covering the past 15 years of GO data with a consistent format and file structure for both the ontology and annotations
    corecore